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Abstract 

A quantum spin chain is identified by the the labels of a vector 
state of an irreducible representation of Wq_>o(s^2)- The intensity of 
the one-spin flip is assumed to depend from the variation of the la- 
bels. The rank ordered plot of the numerically computed, averaged 
in time, transition probabilities is nicely fitted by a Yule distribution, 
which is the observed distribution of the ranked short oligonucleotides 
frequency in DNA 
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1 Introduction 



Quantum spin chains are extremely important tools to understand various 
physical situations and, in some cases, provide completely soluble models. 
An interesting field of applications of these models is the theory of molecular 
biological evolution. Since 1986, when Leuthciusser jTj put a correspondence 
between the Eigen model of evolution [2j and a two-dimensional Ising model, 
many articles have been written representing biological systems as spin mod- 
els. Recently in [3] it has been shown that the parallel mutation-selection 
model can be put in correspondence with the hamiltonian of an Ising quan- 
tum chain and in jl] the Eigen model has been mapped into the hamiltonian 
of one-dimensional quantum spin chains. In this approach the genetic se- 
quence is specified by a sequence of spin values ±1. DNA is build up a 
sequence of a four basis or nucleotides which are usually identified by their 
letter: C, G, T, A (T being replaced by U in RNA), C and U (G and A) 
belonging to the purine family, denoted by R (respectively to the pyrimidine 
family, denoted by Y) . Therefore in the case of genome sequences each point 
in the sequence should be identified by an element of a four letter alphabet. 
As a simplification one identifies each element according to the purine or 
pyrimidine nature, reducing to a binary value, see |p] for a four-state quan- 
tum chain approach. The main aim of the works using this approach, see 
El 13 E] 5 is to find, in different landscapes, the mean "fitness" and the "bi- 
ological surplus", in the framework of biological population evolution. As 
standard assumption, the strength of the mutation is assumed to depend 
from the Hamming distance between two sequences, i.e. the number of sites 
with different labels. Moreover usually it is assumed that the mutation ma- 
trix elements are vanishing for Hamming distances larger than 1, i.e. for 
more than one nucleotide changes. The Hamming distance assumption is 
clearly unrealistic in the domain of genetic mutations. On the other side 
Martindale and Konopka P have remarked that the ranked short (ranging 
from 3 to 10 nucleotides) oligonucleotide frequencies, in both coding and 
non-coding region of DNA, follow a Yule distribution. We recall that a Yule 
distribution is given by / = an^b"', where n is the rank and a, k < and 
b are 3 real parameters. In this paper we propose a quantum spin model in 
which the intensity of the transition matrix depends in some way from the 
whole distribution of the nucleotides in the sequence. At present we assume 
that the transition matrix does not vanish only for total spin flip equal ±1, 
induced by the action of a single step operator, which generally is equivalent 
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to one nucleotide change. The model, which can appear unphysical if applied 
to a quantum spin chain, should be considered on the light of the previous 
remarks on the application to the biological evolution and our aim is to look 
for solutions which can reproduce the observed oligonucleotides distribution. 



A sequence of N ordered nucleotides, characterized only by the purine or 
pyrimidine character, can be represented as a vector belonging to the N- 
fold tensor product of the fundamental irreducible representation (irrep.) 
(labelled by J = 1/2) of Uq^o{sl2) which is usually called crystal basis 
representation. This parametrization allows to represent, in a simple way, the 
mutation of a sequence as a transition between states, which can be subjected 
to selection rules and whose strength depends from the two concerned states. 
So we identify a N-nucleotidic sequence as a state 



where J labels the irrep. which the state belongs to, J3 is the value 
of the 3rd diagonal generator of Uq^o{sl2) (2J3 = ri/j — ny, being the 
number of x elements in the sequence) and J* (2 < i < — 1) are N — 2 
labels needed to remove the degeneracy of the irreps. in the tensor product 
in order to completely identify the state and which can be seen as identifying 
the irrep. which the sequence truncated to the i-th element belongs to. We 
introduce a scalar product, such that 



As an example, we can consider a trinucleotidic string (A^ = 3) and label the 
eight different spin chains in the following way(| J3, J^, J^), R=\ =1, Y = 



2 Mutations and Crystal basis 



J) — I <^3; , . . . , JV • • , J"^) 
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Flipping the total spin by ±1 (AJ3 = ±1) can induce a transition to a state 
belonging or not belonging to the irrep. of the original state. One can easily 
realize that to identify a nucleotidic sequence as a state of an irrep. requires 
to fix the number of RY "contracted couples" occurring in the considered 
sequence (contraction should be understood in the same sense of contraction 
of creation-annihilation operators in the Wick expansion) . Therefore flipping 
a spin implies or the creation or the deletion of a RY contracted couple, 
corresponding respectively to a variation of -1 o +1 of the value of the 
and, possibly, of some others J* (2 < i < — 1), or to leave unmodified 
the number of contracted couples (so that AJ^ = 0, but A J' 7^ for some 
values of i). Note that alternatively one can identify 1/2 = (C, G) and 
— 1/2 = {T,A). Below we give phenomenological arguments for our choice. 

3 Transition operators 

Let us consider a N-nucleotidic string and classify the different transitions 
on the string labels J3, J^, . . . , J^. We can distinguish different string con- 
figurations around the i-th position, so that a single nucleotide mutation in 
i-th position can correspond to different variations in the string labels. We 
call left {right) side free the nucleotides on the left (right) of i-th position 
and not contracted with another one on the same side. Let Ri be the initial 
(before mutation) number of the left side free purines and the initial num- 
ber of the right side free pyrimidines. We want to count the total number of 
contracted KY couples (before and after mutation) in the string, so we call 
Rin {Rfi) the number, in the initial (final) state, of R preceding some Y and 
not contracted with any Y on their side. In the same way, with Y^^ (V/j) we 
refer to the number of Y following some it! and not contracted with any it! 
on their side. If a i? ^ y mutation (AJ3 = — 1) occurs in i-th position, then 
ii!i„ ^ Rfi + 1 and Yin = Yji - 1, where Rin = Ri + 1 and Yn = Yr. Now 
we can write the transition part of the hamiltonian, for the different possible 
initial configuration of the string. The transitions inducing operators are 
built by means of J-,Ai, and their adjoint operators, as below defined. 

• If Ri = Yr we distinguish two subcases: 
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1. Rl = Yr^Q 

N~l N 

^1 = E E "i' (^«.^'^- + ^+ aU) (3) 

i=2 k=i+l 

2. i?; = r,, = 

i/2 = «2(J- + J+) (4) 

• If Ri > Yr we distinguish two subcases: 

1. Yr = Q 

N 

H, = Y,a\{AJ^ + J^A^^ (5) 

1=2 

2. Yr ^ 

Af-1 

• If Ri < Yr we distinguish two subcases: 

1. Ri = 

N 

H, = J2< (J-Al + ^r. J+) (7) 
m=2 

2. Ri^O 

N-2 N-1 

^6 = E E iA,kJ-Ai^, + A^A.+M (8) 

i=2 k=i+l 

where J+ and J_ are the step operators defined by Kashiwara |TU|, acting 
on an irreducible representation with highest weight J^, i.e. inducing the 
transitions AJj = 0, Vz 7^ A^, 

Ai^k \ J) = \ J3, J'^ , , J'' ^ — 1, .., J* — 1, J* ^,..,J^) 

(2 < i < - 1 i + 1 < A; < iV) (9) 

A,, I J) = I J3,J'^-1,...,J*-1,J*"\...,J') 

{2<i<N) (10) 
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Therefore is the operator which increase by 1 the value of J', for k — l< 
I < i. A few words to comment on the above equations. Let us consider 
a mutation R Y which involve a transition AJn = —1 (case Ri > Yr); 
as of the considered transition also entails AJ3 = —1, we have to apply the 
operator J_, as well as the operator Ai. Of course, first we have to lower by 1 
the value of J3, then to modify J^, otherwise the initial state may eventually 
be annihilated, even if the transition is allowed (in the case — 1 < J3). 
Likewise, in corrispondence of a transition Y ^ R (AJ3 = +1), first the 
change + 1 has to be maked, then J3 — J3 + 1. If we want to write 

a self-adjoint operator, we have to sum the operator which gives rise to the 
transition Y ^ R with that one which leads to R ^ Y, leaving the rest of 
the string unmodified, so 

AJ^ + J+A\. (11) 

This operator leads to the mutation Y ^ Roi R ^Y for a nucleotide in i-th 
position, in a string with Ri > Yr. If the mutation R ^ Y corresponds to a 
rising of (i.e. a transition with AJ^r = 1, AJ3 = —1, case Ri < Yr), first 
has to be modified, then J3; therefore we write the self adjoint operator 

J.Al + AmJ+ (12) 

The above operator gives rise to mutations R ^ Y and Y R for a nu- 
cleotide in i-th. position, preceding the m-th one, in the case Ri = 0,Yr 0. 

Let us remark that eq.® is included in eq.©, if the coupling constants a 
are assumed equal; in eq.®, only the writing order for A^+i (and its adjoint) 
and J± has to be respected. Assuming now that the coupling constants do 
not depend on i, k, m, we can write that transition hamiltonian Hj as 

Hj = fi, {H^ + H^)+ fi2Hi + fi3H2 + fi^He (13) 

We let the fenomenology suggests us the scale of the values of the coupling 
constants of Hj. We want to write an interaction term which makes the mu- 
tation in alternating purinic/pyrimidinic tracts less likely than polipurinic 
or polipyrimidinic ones. We mean as a single nucleotide mutation in a 
polipurinic (polipyrimidinic) tract, a mutation inside a string with all nu- 
cleotides R (Y), i.e. a highest (lowest) weight state. Such a transition 
corresponds to the selection rules AJ^ = — 1,AJ3 = ±1, i.e. a transition 
generated by the action of H3 and ifs. In the interation term Hj, we give 
them a coupling constant smaller than the others terms. We introduce, for 
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AJ3 = ±1 , only four different mutation parameters //j (i = 1, 2, 3, 4), witfi 
III < Hk k > 1. 

1. Ill for mutations which change the irrep., AJ^ — ±1 , and include the 
spin flip inside an highest or lowest weight vector; 

2. ij,2 for mutations which do not change the irrep., AJ^ — , but 
modifies other values of J'', AJ^ = ±1 (2 < A; < AT - 1); 

3. /^a for mutations which do not change the irrep., AJ^ = 0, neither 
the other values of J^ AJ^ = 0, (2 < A; < iV - 1); 

4. /i4 for mutations which change the irrep., A J = ±1 , but only in a 
string with 7^ i?; < F^. 

We do not introduce another parameter, for mutations generated by H^, i.e. 
i-ih. nucleotide mutation in a string with i?/ > 1^ 7^ 0, to not distinguish, 
in a polipurinic string, between a mutation in 2-th position and another one 
inside the string. 

Let us emphasize once more that the proposed model takes into account, 
at least partially, the effects on the transition in the i-th site of the distribu- 
tion of all the spins. 

4 Results 

The total hamiltonian of the model will be written as H — Hq + Hj, where 
Hq is the diagonal part in the choosen basis and, in the following, is assumed 
to be Hq = 110J3. In order to evaluate the probabilities of transition, we 
cannot analytically study the time evolution of an initial state, representing 
a given spin chain, as ruled by hamiltonian H. So we look for a numerical 
solution, with a suitable choice of the value of the parameters, for N = 
3,4,6. Before solving numerically the model, it may be useful to point out 
explicitly its main features. Hq is the hamiltonian of a spin chain in presence 
of an uniform, constant, magnetic field (in the biological system it is the 
"fitness" assumed proportional to total purine surplus). Hj describes an 
interaction on the i-th spin neither depending on the position nor on the 
nature of the closest neighbours. Indeed it depends on the "ordered" spin 
orientation surplus on the left and on the right of the i-th position. Should 
it not depend on the order, it may be considered as a mean-field like effect. 
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Moreover AJ3 = ±1 transitions are allowed, which can be considered or as 
the flip of a spin combined with an exchange of the two, oppositely oriented, 
previous or following spins or as the collective flip of particular three spin 
systems, containing a two spin system with opposite spin orientations (see 
example below). Biologically, the transition depends in some way on the 
"ordered" purine surplus on the left and on the right of the mutant position. 
For example, the matrix form of if, on the above basis (for N = 3) is the 
following one, up to a multiplicative dimensional factor /iq 

/ -1 5 7 e 

(5 1 e 

-1 (5 e 

7 5 1 e 

e e -3 5 

e e 5 -1 

e e 6 

\ e e 

Note that the above hamiltonian depends only on three coupling constants 
due to the very short lenght of the chain. For > 4 the 4th coupling 
constant (denoted in the following by t]) will appear. Let us emphasize 
that the hamiltonian ()14|) does not only connect states at unitary Hamming 
distance. As an example, we look at the transitions allowed from | i, ^,0) 
(tit) and from | -i,i,0) (Tii) 

r TTT 

TiT^< lit Tii 
( Tii 

The transition probability between two states shows the typical quantum- 
mechanical rapidly oscillating behaviour in time. We deflne a time-averaged 
transition probability {i —>■ f) 

1 

<Pif>=- v^fit)dt (15) 

where the value of T will be numerically flxed to a value, such that the r.h.s. 
of eq. ()15|l is stable. For lack of space, we do not explicitly write the hamil- 
tonian which only induces transitions between chains at Hamming distance 
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equal to one, with coupling constant (3. Notice howevere that such a (Ham- 
ming) hamiltonian is not obtained by eq.(|14j) putting 5 = 7 = e = /9. If we 
order (in a decreasing way) the average transition probability from an initial 
state to every other state, we obtain, using the hamiltonian with Hamming 
distance, a rank ordered distribution of transition probability like that in 
figfflfor = 4. Its shape does not depend by the choice of initial state or by 
the coupling constant (3 value. We always get the same structure, for mod- 
els with transition probability only depending on Hamming distances. So 
the rank-ordered distribution of the average transition probability shows a 
plateaux structure: every plateaux contains spin sequences at the same Ham- 
ming distance from the initial one. In the case of the model which we propose 
here, i.e. the hamiltonian in (jl4p . the distribution of rank ordered average 
transition probability does not show a plateaux structure, but its shape is 
well fitted by a Yule distribution (figj^), like the rank ordered distribution of 
oligonucleotidic frequency in the strings of nucleic acids jH]. 

Let us observe that we obtain a Yule distribution (and not a plateaux 
structure) even if all parameters in (jl4|) are tuned at the same value, which 
means that the distribution is the outcome of the model and not of the 
choice of the values of the coupling constants. Analogous resultes are gained 
for = 4 (figEl) and iV = 6 (figlD), the state labelled by 1 being the initial 
one. 

As final remarks we point out that: 

i) our model is not equivalent to a model where the intensity depends on 
the site undergoing the transition, or from the nature of the closest neigh- 
bours or the number of the R and Y labels of the sequence; indeed essentially 
the intensity depends on distribution in the sequence of the R and F; 

ii) the ranked distribution of the probabilities, not averaged in time, com- 
puted for several values of the time follows generally a Yule distribution law; 
for the highest value of A^, the distribution is equally well fitted by a Zipf 
law, but not for the lowest values of A^, in agreement with the remark of [H]. 

In conclusion we are far from claiming that our simple quantum mechan- 
ical model explains the observed oligonucleotide distribution for several obi- 
ous reasons. Apart from the extremely simplifying assumption made (initial 
state as a pure state) and from the fact that we are really comparing different 
data, the experimental ones being derived from the splitting in short oligonul- 
ceotide sequences of several far longer sequences, we are relating quantities 
computed in a quantum world to classical observables. In any way we believe 
that the proposed model exhibits intriguing and interesting features, hinting 
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in the right direction, which are worthwhile to be further investigated, in 
particular either to make more clear the connection quantum model-classical 
quantities either to reformulate our transition matrix in order to be inserted 
in a classical master equation. It is worthwhile to remark that we are try- 
ing to compare theoretical results, deriving from a simple model (depending 
only on 4 parameters for any N), to really observed data, coming from the 
extremely complex biological world. More details and further developments 
will be presented in a longer paper. 
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Figure 1: Rank ordered distribution of time- averaged transition probability 
from the initial status |t| for a dynamics generated by Hamming hamilto- 
nian, with (3 = 0.5. The first point is out of the graph. 




Figure 2: Rank ordered distribution of time-averaged transition probability / 
from the initial status tt| for a dynamics generated by H with e = 0.1, 7 = 
5 = 0.3. The distribution was fitted by a Yule function (continuous line) 
/ = aR^h^ {R is the rank). The parameters was estimated as a = 1.96, b = 
0.24, k = -1.49. 
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Figure 3: Rank ordered distribution of time averaged transition probability 
/ from the initial status jiti for ^ dynamics generated by H, with e = 
0.1, 7 = 5 = ?7 = 0.5. The distribution was fitted by a Yule function 
(continuous line) / = aR^b^ [R is the rank). The parameters was estimated 
as a = 0.60, b = 0.72 k = -0.64. 
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Figure 4: Rank ordered distribution of time averaged transition probability 
/ from the initial status TiTiTi ^ dynamics generated by H, with e = 
0.1, 'y = 6 = rj = 0.5. The distribution was fitted by a Yule function 
(continuous line) / = aR^b^ {R is the rank). The parameters was estimated 
as a = 0.21, b = 0.95 k = -0.66. 
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